Comparative analysis of delivered and planned doses in target volumes for lung stereotactic ablative radiotherapy

Background Adaptive therapy has been enormously improved based on the art of generating adaptive computed tomography (ACT) from planning CT (PCT) and the on-board image used for the patient setup. Exploiting the ACT, this study evaluated the dose delivered to patients with non-small-cell lung cancer (NSCLC) patients treated with stereotactic ablative radiotherapy (SABR) and derived relationship between the delivered dose and the parameters obtained through the evaluation procedure. Methods SABR treatment records of 72 patients with NSCLC who were prescribed a dose of 60 Gy (Dprescribed) to the 95% volume of the planning target volume (PTV) in four fractions were analysed in this retrospective study; 288 ACTs were generated by rigid and deformable registration of a PCT to a cone-beam computed tomography (CBCT) per fraction. Each ACT was sent to the treatment planning system (TPS) and treated as an individual PCT to calculate the dose. Delivered dose to a patient was estimated by averaging four doses calculated from four ACTs per treatment. Through the process, each ACT provided the geometric parameters, such as mean displacement of the deformed PTV voxels (Warpmean) and Dice similarity coefficient (DSC) from deformation vector field, and dosimetric parameters, e.g. difference of homogeneity index (ΔHI, HI defined as (D2%-D98%)/Dprescribed*100) and mean delivered dose to the PTV (Dmean), obtained from the dose statistics in the TPS. Those parameters were analyzed using multiple linear regression and one-way-ANOVA of SPSS® (version 27). Results The prescribed dose was confirmed to be fully delivered to internal target volume (ITV) within maximum difference of 1%, and the difference between the planned and delivered doses to the PTV was agreed within 6% for more than 95% of the ACT cases. Volume changes of the ITV during the treatment course were observed to be minor in comparison of their standard deviations. Multiple linear regression analysis between the obtained parameters and the dose delivered to 95% volume of the PTV (D95%) revealed four PTV parameters [Warpmean, DSC, ΔHI between the PCT and ACT, Dmean] and the PTV D95% to be significantly related with P-values < 0.05. The ACT cases of high ΔHI were caused by higher values of the Warpmean and DSC from the deformable image registration, resulting in lower PTV D95% delivered. The mean values of PTV D95% and Warpmean showed significant differences depending on the lung lobe where the tumour was located. Conclusions Evaluation of the dose delivered to patients with NSCLC treated with SABR using ACTs confirmed that the prescribed dose was accurately delivered to the ITV. However, for the PTV, certain ACT cases characterised by high HI deviations from the original plan demonstrated variations in the delivered dose. These variations may potentially arise from factors such as patient setup during treatment, as suggested by the statistical analyses of the parameters obtained from the dose evaluation process.


Stereotactic ablative radiotherapy treatment for patients with non-small cell lung cancer
Stereotactic ablative radiotherapy (SABR), also known as stereotactic body radiation therapy (SBRT), is a radiotherapy procedure that is highly effective in controlling early stage primary or oligometastatic cancers.It delivers a higher biologically effective dose to the tumour than does conventional radiotherapy [1][2][3].Advancements in technologies, such as intensity-modulated radiation therapy (IMRT) and image-guided radiation therapy (IGRT) have helped achieve treatment goals by maintaining an acceptable therapeutic ratio with excellent dose conformity in SABR.An accurate patient setup facilitated by IGRT is a crucial aspect of SABR.Owing to advancements in IGRT, SABR has been applied to patients with early stage non-small cell lung cancer (NSCLC) who are medically inoperable or prefer non-invasive treatment [4][5][6][7][8].

Adaptive radiation therapy with CBCT
One of the most critical aspects of IGRT is the application of imaging techniques for precise patient positioning during treatment sessions.In external beam radiation therapy, cone-beam computed tomography (CBCT), which is integrated with a linear accelerator, is extensively used to position a patient based on the stance taken during treatment simulation.The CBCT image can provide information on the patient's treatment position, allowing estimation of the dose delivered to the gross tumour volume (GTV) and surrounding organs at risk (OARs).However, because of the known limitations of CBCT, such as the large uncertainty of the CT numbers arising from scattered photons [9][10][11] and several other effects [12][13][14], CBCT is considered inappropriate for direct dose computation [15].This limitation prevents CBCT from being used as a planned CT for adaptive radiation therapy when the patient's anatomy has been seriously deformed throughout the treatment course.
Advances in the DIR algorithm and CBCT image quality [29][30][31][32] have enabled the accumulation of the delivered dose using the daily CBCT [33,34] in addition to the efforts to overcome the limitations of CBCT.Based on the delivered dose estimation to the GTV and OARs, adaptive radiation therapy facilitates the modification of treatment plans to achieve optimal outcomes by adapting to the observed anatomic or physiologic variations in patients from the initial simulation.

Delivered dose estimation using deformable image registration
Synthetic CT, also called adaptive CT (ACT), can be acquired by aligning and deforming the PCT onto daily CBCT.This process involves both rigid and deformable image registrations.In SABR, CBCT acquired for each patient's setup at every fraction enables the creation of the ACT, allowing for the calculation of daily doses that reflect any variations from the initial simulation, and ultimately, the prediction of the total dose actually delivered.
Previous studies [35,36] have evaluated the dose delivered to patients with NSCLC using the same commercial software.They either focused on a limited number of PTV parameters or examined only a few treatment sessions.This study aimed to assess the dose delivered throughout the treatment course, reflecting the patient's anatomic and physiologic variations.Furthermore, maximum possible PTV parameters were collated through the dose evaluation process and were analysed to understand the relationship between the dose delivered to the PTV in each fraction and various parameters.This approach can help to identify the factors contributing to the observed dose distribution.were collected.The patients were treated with 60 Gy (D prescribed ) in four fractions between 2019 and 2023, using volumetric arc-modulated therapy of a 6 MV flattening filter-free photon beam of Varian TrueBeam STX (Varian Medical Systems Inc., Palo Alto, CA, USA).The D prescribed was prescribed to the 95% volume of the PTV.For lung SABR treatment planning, the four-dimensional CTs (4DCTs) obtained with thoracic scan protocol were reconstructed into 10 respiratory phases by Brilliance CT Big Bore™ (Royal Philips Electronics, Amsterdam, Netherlands), and then their average CT image was generated for the treatment plan.The maximum intensity projection (MIP) of the tumour from each phase was integrated and contoured as an ITV, and the PTV was determined by embracing the ITV on a patient-by-patient basis with a 5-7-mm margin, compensating for several treatment uncertainties.Treatment plans were created using a treatment planning system (TPS, Eclipse Ver.13.5 and 16.1 with Acuros XB algorithm, Varian Medical Systems Inc., Palo Alto, CA, USA).In the treatment room, the patients were positioned on the couch by matching the acquired daily CBCT image to the PCT image before treatment.The specifications of the PCT and CBCT images are listed in Table 1.The treatment records, including a set of Dicom RT plan (RP), RT dose (RD), RT structures (RS), the PCT, and daily CBCT images were exported into the Velocity software (Ver.4.1, Varian Medical Systems Inc., Palo Alto, CA, USA) to generate ACT.

Dose evaluation procedure
ACT generation began by aligning the PCT with CBCT.To estimate the dose delivered to the patient at the time of treatment, the couch location recorded by the CBCT matching to the PCT was used for rigid registration.The daily CBCT images were acquired with a free-breading.After applying rigid registration, deformable image registration of the PCT on CBCT was performed using the multipass B-spline algorithm [37,38] in the Velocity software.The RS, including the ITV, PTV, and normal structures, was automatically propagated following the deformable vector field (DVF) of the PCT to the CBCT to generate adaptive RS (aRS) on the ACT.The ACT and aRS were then transferred to the TPS to calculate the delivered dose (aRD), reflecting the daily variations from the initial simulation.After the dose calculation was completed in the TPS, the accompanying aRD with the ACT was copied back to the Velocity software to deform the aRD back to the PCT coordinate.Each patient's record contained four CBCTs, thus resulting in four ACTs and corresponding four aRDs.Therefore, the total dose delivered to the patient was estimated by averaging the four aRDs in the PCT coordinates.The workflow of the dose evaluation procedure is summarised in Fig. 1.

Data analysis with parameters
Through the dose evaluation procedure described earlier, parameters related to the ITV and PTV were collected, as summarised in Table 2.These parameters were obtained from the dose statistics of the RD and aRDs in the TPS, deformable quality assurance [39], comparison of the RS and aRS, and from the dose accumulation processes using Velocity software.Dose-evaluation indices, such as homogeneity index [40], were calculated and used in the analysis.Obtained parameters were assessed to determine their relationship to the treatment target, specifically the dose delivered to the 95% volume of the PTV (D 95% ), using a stepwise multiple linear regression model in SPSS® (Ver.27, IBM®, Chicago, IL, USA).As the study assessed the treatment records of 72 patients, a total of 288 ACT cases was analysed.Through the analysis, significant parameters that passed the normality test and multicollinearity check were selected, and their p-values were reported.Additionally, analysis of variance (ANOVA) was performed to examine the mean difference between the parameters depending on the tumour location.

Statistics of the parameters
After the multiple linear regression analysis, the PTV parameters proven to be significant were examined.First, basic histograms of the PTV parameters including PTV D 95% and two-dimensional distributions of the parameters were presented.Volumes of the ITV and PTV were reviewed in the PCT and ACTs, and the average volume differences between the ACTs and PCT were investigated.

Delivered dose evaluation
The dose delivered to the PTV and ITV was estimated from the aRDs and compared with the planned dose shown in Figs. 2 and 3, respectively.The planned and delivered doses were agreed within the uncertainty of the delivered doses.From a frequentist's statistical perspective, the delivered dose to the PTV agreed with the

Relations between the tumour location and PTV parameters
Each lung is divided into sections known as lobes that are closely associated with various structures within the thoracic cavity.The diaphragm, a dome-shaped primary respiratory muscle, is expected to undergo the most significant respiratory motion.Given that the diaphragm is located beneath the lower lung lobes, there has been specific interest in exploring the dependence of the values of PTV parameters, for example, the estimated delivered dose vs. tumour location.By assessing the mean values of the PTV parameters from 288 ACT cases, a significant difference was observed in PTV displacement and PTV D95% between the lobes (P-value less than 0.05) in Table 4.

Tumour volume change
Previous studies on NSCLC SABR treatment have reported changes in tumour volume, despite a short treatment duration of typically 7-12 days.Using various imaging modalities, some studies [41,42] reported a consistent decrease in GTV, whereas others [43][44][45] noted a slight initial increase in GTV during the treatment course, followed by a decrease.At our institute, treatment records document the ITV instead of the GTV; therefore, this study investigated the changes in the ITV  volume during the course of treatment.Considering the average volume differences per fraction for both the ITV and PTV are nearly zero and significantly smaller than the SD in Fig. 4(c) and (d), volume changes during the SABR treatment did not constitute a significant concern in this study.
In the ACTs, the ITVs and PTVs were automatically recontoured following the DVF obtained from the DIR using commercial software, and their volumetric differences were consistently negative.These plots indicate that the recontoured structures exhibit a marginally smaller volume than their original volume, and the volume difference may originate from the differences in the method of treating the respiratory motion between PCT and CBCT.The ITV in the PCT was delineated based on the MIP of the GTV, whereas the CBCT was acquired with free-breathing.This might have affected the ITV in the ACT, reflecting that the volumes recorded in the free-breathing CBCT are marginally smaller than the ITV and PTV in the PCT.However, this volume difference was significantly smaller than the SD.Comparing the values in Fig. 4(c) and (d), the average ITV differences are approximately twice as large as those of the PTV.These discrepancies might have occurred because the ITV, when used as the denominator to calculate the volume difference, is smaller than the PTV.

Recontoured structures for dose evaluation
Concerns may arise regarding the accuracy of the aRS recontoured automatically using the commercial software.When applying the ACT for adaptive planning, it is essential for a radiation oncologist to review and correct the delineation of the ITV and PTV prior to treatment approval.Nonetheless, the automatically recontoured ITV and PTV on ACT, based on the DVF, were considered adequate for evaluating the dose delivered to the patients.This study was performed to investigate how the original PTV was deformed in the treatment and whether  the planned dose was actually delivered to the original PTV.Among the collected patient data, the ITV and PTV delineations were randomly selected and reviewed by an expert for comparison with manual recontouring.This comparison revealed no significant differences in dosimetric evaluation, thus supporting the use of automatically recontoured structures for dose evaluation across all 288 ACT cases.

PTV parameters depending on the tumour location
The PTV located in the lower lobes showed a greater discrepancy in the position between the treatment and simulation.The mean Warp mean value was notably higher and the mean D 95% value was significantly lower for tumours located in the lower lobes compared with those of tumours in the upper lobes (Table 4).Furthermore, the SDs for PTVs in the lower lobes exhibited a significant increase, indicating that tumours located in the upper lobes allow for more consistent patient setup reproducibility, thereby enhancing the accuracy of the prescribed dose delivery to the PTV.Among the seven outliers, six had PTV located in the lower lobes.This may be attributed to the observed higher Warp mean and lower D 95% values in the lower lobes, a trend that these outliers similarly exhibited.

Outliers of the multiple linear regression model prediction
Seven outliers marked in Figs. 5 and 6 are characterised by having the highest ΔHI and small PTV volume.These outliers show significant deviations from linearity, as shown in Fig. 8.The PTV D 95% values of the outliers calculated from the ACTs in the TPS deviated from the PTV D 95% predictions of the multiple linear regression.Two geometric and two dosimetric parameters were found to be significantly related to the D 95% in the multiple linear regression, and the detailed results are listed in the Table 3.
To evaluate the effect of ACTs with a high ΔHI, stepwise multiple linear regression was repeated, excluding ACT cases of HI > 17, based on the distribution in Fig. 5(b).The results summarised in Table 5 indicate that only the dosimetric parameters maintained a significant correlation with D 95% .This indicates that the outliers may be closely linked to the two geometric parameters that represent the PTV displacement.Such displacement might imply uncertainties from the patient setup and respiratory motion, resulting in dosimetric uncertainty.Kim et al. [46] highlighted that geometric uncertainties in patient positioning can limit the clinical advantages of IMRT.Therefore, during SABR treatment, a more thorough consideration of patient setup and respiratory motion is imperative.
Although these cases exhibited suboptimal dosimetric outcomes for the PTV, Fig. 3 confirms that the dose was accurately delivered to the ITV.This demonstrates that the margin between the ITV and PTV effectively ensures sufficient dose coverage of the ITV.As the patients were treated under the prescription of one radiation oncologist, the ITV and PTV were consistently determined.However, it is worth studying whether the margin can be further reduced with respect to various parameters obtained.In this study, analysis of multiple parameters enabled a more precise evaluation of the dose delivered.

Uncertainties affecting the deformed image
In DIR, the PCT is a moving image, while the daily CBCT is a stationary image.When the PCT is deformed to the daily CBCT, the deformed PCT is called an ACT, retaining patient information at the time of the treatment setup.In the TPS dose calculation, the delivered dose based on the ACT was calculated using the original beam plan.Thus, we believe that the setup error implied by the ACT affects the estimated delivered dose.
Although the QA parameters of DIR [39] were assessed and included in the analysis (Table 2), the uncertainty  of the DIR algorithm was not considered.Repeating the DIR using the same CBCT and PCT images is insufficient to estimate the DIR uncertainty; however, the DIR uncertainty is expected to be systematic without giving a rise to an outlier.As the outliers of interest occurred randomly among the 288 ACT cases, they likely originated from a random source, such as the setup error rather than the DIR algorithm error.

Limitations of this study
One limitation is the resolution of the parameters obtained by comparing the images.The image resolutions of the PCT and daily CBCT are presented in Table 1.Recalling the method of generating the ACT, it is a result of the deformation of the PCT to the daily CBCT, thus the resolution of the ACT is same as that of the PCT; the transverse and vertical resolutions of the ACT were approximately 1.3 and 3 mm, respectively.Considering the Warp mean , which showed a peak around 1.3 mm (Fig. 7b), it is challenging to calculate any finer displacement.However, we used the average value of the displacement calculated from PTV voxels; thus, we did not rely on a single movement value but on the movement trend of the PTV structure.Secondly, a methodology covering intra-fractional motion during SABR was lacking.Monitoring the intrafractional motion might be optimal for IMRT treatment using an MR-Linac.In the CBCT-Linac option, it is difficult to track real-time respiratory motion during treatment.In particular, for SABR, respiratory motioncontrolled treatments, such as DIBH, are not normally considered.Although we observed one case of using continuous positive airway pressure breathing, the respiratory motion was not monitored.
Intra-fractional motion was considered as the respiratory motion in the PCT and ACT images, and the patients were subjected to treatment with a free-breathing.In PCT, the respiratory motion was assessed using the MIP of the tumour.As we did not control the patient's respiration during CBCT acquisition, which took approximately 1 min, the image was expected to represent the average motion of the GTV.Surrounding OARs were also obtained with average intensity projection (AIP) on PCT and with the free-breathing on CBCT.

Comparison of the result with previous studies
Previous studies utilised the same commercial software to evaluate the dose delivered to patients with NSCLC.Czajkowski P et al. [35] evaluated the accuracy of dose delivery in the stereotactic radiation therapy for both brain and lung cancers.Although they analysed only 10 patients for the lung SABR dose evaluation, they reported no significant change on the ITV volume during the treatment and agreement within ± 10% on the dose in 99% volume of the PTV between the PCT and ACT.They assessed only the PTV volume and DSC regarding the PTV change.On the contrary, Wang B et al. [36] investigated the differences of delivered and planned dose to PTV as well as to the OARs, which were clinically acceptable.They used records of 27 patients with locally advanced NSCLC who were treated with 51 Gy in 17 fractions.They generated ACTs for treatment in 1,5,9,13, and 17.A significant tumour shrinkage (11.1%) was observed through the course.However, no significant difference was discovered in the volume of 51 Gy isodose line corresponding to the PTV, but limited increase (< 5%) was observed in total lung, oesophagus, and heart.
Compared with previous studies, we calculated delivered dose for every fraction and assessed the PTV parameters as much as possible with an increased number of patients.We were able to obtain distributions of the significant PTV parameters and the parameters were proven to be related to the PTV D 95% .Besides the delivered dose evaluation, the PTV D 95% and the PTV displacement from the PCT were observed significantly related to the tumour location.

Conclusions
Throughout the evaluation of the delivered dose, we confirmed that the prescribed dose was successfully delivered to the ITV.The analysis showed that the HI difference between the ACT and PCT was the most sensitive parameter for the delivered PTV D 95% .Although the dose was delivered to the PTV successfully in most cases, a few outliers with higher ΔHI, that degraded the PTV dose distribution, were observed.Judging by the relationship between geometric parameters of the PTV and the worse PTV D 95% , these outliers might be caused by the misaligned patient setup.Analysis of the delivered dose along with the parameters obtained during the evaluation process demonstrated that the PTV margin effectively compensated for the setup uncertainty.

Fig. 2 Fig. 1
Fig. 2 Dose comparison between planned (filled square) and estimated delivered (open square) dose to planning target volume (PTV) and their standard deviations: minimum (D min ), mean (D mean ), maximum dose (D max ) to the PTV, and the dose delivered to the 95% volume of the PTV (D 95% )

Fig. 3
Fig. 3 Dose comparison between planned (filled square) and estimated delivered (open square) dose to internal target volume (ITV) and their standard deviations: minimum (D min ), mean (D mean ), maximum (D max ) dose delivered to the ITV

Fig. 6 Fig. 5
Fig. 6 (a) One dimensional distribution of the PTV volume (cc) and (b) scattered plot of the delivered dose to 95% volume of the PTV (D 95% ) vs. the PTV volume.Concerned outlier cases were highlighted in the scatter plot

Fig. 7
Fig. 7 One-dimensional distribution of the independent parameters: (a) mean delivered dose to PTV (D mean ), (b) mean Warping distance (Warp mean ), (c) Dice coefficient of similarity (DSC), and a dependent parameter (d) delivered dose to 95% volume of the PTV (D 95% )

Fig. 8
Fig. 8 Scattered plot of the residual of the dose prediction vs. the predicted valued of the regression model.Concerned outliers were highlighted in the scatter plot

Table 1
Specifications of the treatment simulation CT and cone beam CT

Table 2
List of parameters collected for internal and planning target volume

Table 3
Results of the multiple linear regression *VIF: Variance Inflation Factor, Tolerance: a measure of collinearity, ΔHI: difference of homogeneity index, D mean : mean delivered dose to PTV, DSC: Dice Similarity Coefficient, Warp mean : mean displacement of PTV

Table 4
Means and standard deviations of the D 95% , DSC, and Warp mean distributions of the PTV depending on the tumour location in the lung lobe.Meaningful differences were assessed by the P-value

Table 5
Results of the multiple linear regression by SPSS® (v27, IBM®, Chicago, IL, USA) with cases having ΔHI less than 17 *VIF: Variance Inflation Factor, Tolerance: a measure of collinearity, ΔHI: difference of homogeneity index, D mean : mean delivered dose to PTV